Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: better ingestion timestamp warning #27122

Open
wants to merge 5 commits into
base: master
Choose a base branch
from

Conversation

pauldambra
Copy link
Member

we reject replay messages if the first_timestamp is invalid

but in some cases the last_timestamp is valid

we see this consistently (but at least at a very very low rate). if you have a very short session we might be dropping the only message we see for that session.

i'd like to capture a custom event into the recording instead of totally dropping the payload or we have no way of diagnosing what is happening

i want to capture a little more information since from what i have i can't quite figure out the safe way to capture the event

NB I won't merge this until 6th Jan regardless, since i'm not really working

@@ -148,6 +153,8 @@ export class ReplayEventsIngester {
isValid: asDate.isValid,
daysFromNow: Math.round(Math.abs(asDate.diffNow('day').days)),
processingTimestamp: DateTime.now().toISO(),
eventTypes,
customEvents,
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

these all have small controlled payloads we create so are safe to capture here

Comment on lines 142 to 143
type: event.type,
timestamp: event.timestamp,
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this will hopefully let me spot any patterns in the bad data, and figure out how I can safely capture a custom event instead of dropping here

@@ -138,6 +138,11 @@ export class ReplayEventsIngester {
if (replayRecord !== null) {
const asDate = DateTime.fromSQL(replayRecord.first_timestamp)
if (!asDate.isValid || Math.abs(asDate.diffNow('day').days) >= 7) {
const eventTypes = rrwebEvents.map((event) => ({
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Two loops over what could potentially be arrays with 1000s of entries? Given how sensitive blobby is to cpu work, maybe we could:

  1. Do this in one loop rather than a map and filter
  2. Wait till after christmas to merge

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. fair
  2. definitely!

@posthog-bot
Copy link
Contributor

This PR hasn't seen activity in a week! Should it be merged, closed, or further worked on? If you want to keep it open, post a comment or remove the stale label – otherwise this will be closed in another week. If you want to permanentely keep it open, use the waiting label.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants